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Statistics of Galaxy Clustering 
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Abstract 

In this introductory talk we will establish connections between 
the statistical analysis of galaxy clustering in cosmology and recent 
work in mainstream spatial statistics. The lecture will review the 
methods of spatial statistics used by both sets of scholars, having 
in mind the cross-fertilizing purpose of the meeting series. Special 
topics will be: description of the galaxy samples, selection effects 
and biases, correlation functions, nearest neighbor distances, void 
probability functions, Fourier analysis, and structure statistics. 



1.1 Introduction 

One of the most important motivations of these series of conferences is to 
promote vigorous interaction between statisticians and astronomers. The 
organizers merit our admiration for bringing together such a stellar cast of 
colleagues from both fields. In this third edition, one of the central subjects 
is cosmology, and in particular, statistical analysis of the large-scale struc- 
ture in the universe. There is a reason for that — the rapid increase of the 
amount and quality of the available observational data on the galaxy dis- 
tribution (also on clusters of galaxies and quasars) and on the temperature 
fluctuations of the microwave background radiation. 

These are the two fossils of the early universe on which cosmology, a sci- 
ence driven by observations, relies. Here we will focus on one of them — the 
galaxy distribution. First we briefly review the redshift surveys, how they 



^Observatori Astronomic, Universitat de Valencia, Burjassot, 46100 Valencia, Spain, 
e-mail: vicent.martinez@uv.es 

■^Tartu Observatoorium, Toravere, 61602, Estonia, e-mail: saar@aai.ee 



2 



Vicent J. Martmez and Enn Saar 



are built and how to extract statistically analyzable samples from them, 
considering selection effects and biases. Most of the statistical analysis of 
the galaxy distribution are based on second order methods (correlation 
functions and power spectra). We comment them, providing the connection 
between statistics and estimators used in cosmology and in spatial statis- 
tics. Special attention is devoted to the analysis of clustering in Fourier 
space, with new techniques for estimating the power spectrum, which are 
becoming increasingly popular in cosmology. We show also the results of 
applying these second-order methods to recent galaxy redshift surveys. 

Fractal analysis has become very popular as a consequence of the scale- 
invariance of the galaxy distribution at small scales, reflected in the power- 
law shape of the two-point correlation function. We discuss here some of 
these methods and the results of their application to the observations, 
supporting a gradual transition from a small-scale fractal regime to large- 
scale homogeneity. The concept of lacunarity is illustrated with some detail. 

We end by briefly reviewing some of the alternative measures of point 
statistics and structure functions applied thus far to the galaxy distribu- 
tion: void probability functions, counts-in-cells, nearest neighbor distances, 
genus, and Minkowski functionals. 



1.2 Cosmological datasets 

Cosmological datasets differ in several respects from those usually stud- 
ied in spatial statistics. The point sets in cosmology (galaxy and cluster 
surveys) bear the imprint of the observational methods used to obtain 
them. 

The main difference is the systematically variable intensity (mean den- 
sity) of cosmological surveys. These surveys are usually magnitude-limited, 
meaning that all objects, which are brighter than a pre-determined limit, 
are observed in a selected region of the sky. This limit is mainly determined 
by the telescope and other instruments used for the program. Apparent 
magnitude, used to describe the limit, is a logarithmic measure of the 
observed radiation flux. 

It is usually assumed that galaxies at all distances have the same (uni- 
versal) luminosity distribution function. This assumption has been tested 
and found to be in satisfying accordance with observations. As the observed 
flux from a galaxy is inversely proportional to the square of its distance, we 
can see at larger distances only a bright fraction of all galaxies. This leads 
directly to the mean density of galaxies that depends on their distance from 
us r. 

This behaviour is quantified by a selection function ^{r), which is usu- 
ally found by estimating first the luminosity distribution of galaxies (the 
luminosity function). 
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One can also select a distance limit, find the minimum luminosity of 
a galaxy, which can yet be seen at that distance, and ignore all galaxies 
that are less luminous. Such samples are called volume-limited. They are 
used for some special studies (typically for counts-in-cells), but the loss of 
hard-earned information is enormous. The number of galaxies in volume- 
limited samples is several times smaller than in the parent magnitude- 
limited samples. This will also increase the shot (discreteness) noise. 

In addition to the radial selection function 4>{r), galaxy samples also are 
frequently subject to angular selection. This is due to our position in the 
Galaxy — we are located in a dusty plane of the Galaxy, and the window in 
which we see the Universe, also is dusty. This dust absorbs part of galaxies' 
light, and makes the real brightness limit of a survey dependent on the 
amount of dust in a particular line-of-sight. This effect has been described 
by a (j){b) ^ (sin5)~^ law (6 is the galactic latitude); in reality the dust 
absorption in the Galaxy is rather inhomogeneous. There are good maps of 
the amount of Galactic dust in the sky, the latest maps have been obtained 
using the COBE and IRAS sateUite data ( [Schlegel et~aL||l998D . 

Edge problems, which usually affect estimators in spatial statistics, also 
are different for cosmological samples. The decrease of the mean density 
towards the sample borders alleviates these problems. Of course, if we se- 
lect a volume-limited sample, we select also all these troubles (and larger 
shot noise). From the other side, edge effects are made more prominent by 
the usual observing strategies, when surveys are conducted in well-defined 
regions in the sky. Thus, edge problems are only partly alleviated; maybe 
it will pay to taper our samples at the side borders, too? 

Some of the cosmological surveys have naturally soft borders. These are 
the all-sky surveys; the best known is the IRAS infrared survey, dust is 
almost transparent in infrared light. The corresponding redshift survey is 
the PSCz survey, which covers about 85% of the sky ( ^aunders et al. 2000). 
A special follow-up survey is in progress to fill in the remaining Galactic 
Zone-of-Avoidance region, and meanwhile numerical methods have been 
developed to interpolate the structures seen in t he survey into the gap 
( ^chmoldt erall|l999| , [Saunders fc Ballingei] |2000| ) . 

Another peculiarity of galaxy surveys is that we can measure exactly only 
the direction to the galaxy (its position in the sky), but not its distance. 
We measure the radial velocity (or redshift z = Vr/c, c is the velocity of 
light) of a galaxy, which is a sum of the Hubble expansion, proportional to 
the distance d, and the dynamical velocity Vp of the galaxy, Vr = Hod + Vp. 
Thus we are differentiating between redshift space, if the distances simply 
are determined as d = Vr/Ho, and real space. The real space positions 
of galaxies could be calculated if we exactly knew the peculiar velocities 
of galaxies; we do not. The velocity distortions can be severe; well-known 
features of redshift space are fingers-of-God, elongated structures that are 
caused by a large radial velocity dispersion in massive clusters of galaxies. 
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The velocity distortions expand a cluster in redshift space in the radial 
direction five-ten times. 

For large-scale structures the situation is different, redshift distortions 
compress them. This is due to the continuing gravitational growth of 
structures. These differences can best be seen by comparing the results 
of numerical simulations, where we know also the real-space situation, in 
redshift space and in real space. 

The last specific feature of the cosmology datasets is their size. Up to 
recent years most of the datasets have been rather small, of the order of 
10'^ objects; exceptions exist, but these are recent. Such a small number of 
points gives a very sparse coverage of three-dimensional survey volumes, 
and shot noise has been a severe problem. 

This situation is about to change, swinging to the other extreme; the 
membership of new redshift surveys already is measured in terms of 10^ 
(160,000 for the 2dF survey, quarter of a million planned) and million- 
galaxy surveys are on their way (the Sloan Survey). More information about 
these surveys can be found in their Web pages: http://www.mso.anu.edu.au/ 
2dFGRS/ for the 2dF survey and http://www.sdss.org/ for the Sloan sur- 
vey. This huge amount of data will force us to change the statistical methods 
we use. Nevertheless, the deepest surveys (e.g., distant galaxy cluster sur- 
veys) will always be sparse, so discovering small signals from shot-noise 
dominated data will remain a necessary art. 



1.3 Correlation analysis 

There are several related quantities that are second-order characteristics 
used to quantify clustering of the galaxy distribution in real or redshift 
space. The most popular one in cosmology is the two-point correlation 
function, ^(r). The infinitesimal interpretation of this quantity reads as 
follows: 

dPi2=n^[l + ^{r)]dVidV2 (1.1) 

is the joint probability that in each one of the two infinitesimal volumes 
dVi and dV2, with separation vector r, lies a galaxy. Here n is the mean 
number density (intensity) . Assuming that the galaxy distribution is a ho- 
mogeneous (invariant under translations) and isotropic (invariant under 
rotations) point process, this probability depends only on r = |r|. In spatial 
statistics, other functions related with ^(r) are commonly used: 

A2(r) =n2e(r) + l, .g(r) = 1 + ^(r), r(r) = n(f(r) + 1), (1.2) 



where A2(r) is the second-order intensity function, g{r) is the pair cor- 
relation function, also called the radial distribution function or structure 
function, and T(r) is the conditional density proposed by Pictronero (1987). 
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Different estimators of ^(r) have been proposed so far in the literature, 
both in cosmology and in spatial statistics. The main differences are in 
correction for edge effects. Comparison of their performance can be found 
in several papers (Pons-Bordcrfa ct al. 1999, Kcrschcr ct al. pOOO , 3toyan 
& StoyaE 2000). There is clear evidence that ^(r) is well described by a 
power-law at scales 0.1 < r < 10 h^^ Mpc where h is the Hubble constant 
in units of 100 km s^^ Mpc^^: 



with 7 ~ 1.8 and tq ~ 5.4 ft, ^ Mpc. This scaling behavior is one of the 
reasons that have lead some astronomers to describe the galaxy distribution 



as fractal. A power-law fit for g{r) oc 



permits to define the correlation 



dimension The extent of the fractal regime is still a matter of debate 
in cosmology, but it seems clear that the available data on redshift surveys 
indicate a gradual transit ion to homogeneity for scales larger than 15-20 
h^^ Mpc ( Martinez 199S| ). Moreover, in a fractal point distribution, the 
correlation length rp increases with the radius of the sample because the 
mean density decreases ( [Pietronero 1987). This simple prediction of the 
fractal interpretation is not supported by the data, instead vq remains 



constant for volume-limited samples with increasing depth ( Martinez et al 
200T| ). 

Several versions of the volume integral of the correlation function are also 
frequently used in the analysis of galaxy clustering. The most extended one 
in spatial statistics is the so-called Ripley i^-function 



K{r) 



47rs^{l + ^{s))ds 



(1.3) 



although in cosmology it is more frequent to use an expression which pro- 
vides directly the average number of neighbors an arbitrarily chosen galaxy 
has within a distance r, N{< r) = nK{r) or the average conditional density 



r 



T{s)s^ds 



Again a whole collection of estimators are used to properly evaluate 
these quantities. Pietronero and coworkers recommend to use only minus- 
estimators to avoid any assumption regarding the homogeneity of the 
process. In these estimators, averages of the number of neighbors within a 
given distance are taken only considering as centers these galaxies whose 
distances to the border are larger than r. However, caution has to be ex- 
ercised with this procedure, because at large scales only a small number of 
centers remain, and thus the variance of the estimator increases. 

Integral quantities are less noisy than the corresponding differential ex- 
pressions, but obviously they do contain less information on the clustering 
process due the fact that values of K{ri) and K{r2) for two different scales 
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ri and r2 are more strongly correlated than values of ^(^i) and C(''2)- Scal- 
ing of A^(< r) oc r^'^ provides a smoother estimation of the correlation 
dimension. If scaling is detected for partition sums defined by the moments 
of order q of the number of neighbors 



N 



1=1 



oc r 



the exponents D g are the so-called generalized or multifractal dimensions 
(Martinez et al. 1990). Note that for q = 2, Z{2,r) is an estimator of 



N{< r) and therefore Dq for q = 2 is simply the correlation dimension. If 
different kinds of cosmic objects are identified as peaks of the continuous 
matter density field at different thresholds, we can study the correlation 
dimension associated to each kind of object. The multiscaling approach 
( Icnscn ct al.| |199l| ) associated to the multifractal formalism provides a 



unified framework to analyze this variation. It has been shown (Martinez 
1995| ) that the value of D2 corresponding to rich galaxy clusters 



et al 



(high peaks of the density field) is smaller than the value corresponding 
to galaxies (within the same scale range) as prescribed in the multiscaling 
approach. 

Finally we want to consider the role of lacunarity in the description 
of the galaxy clustering ( Martinez fc Saar 2001 ). In Fig. 1.1, we show 
the space distribution of galaxies within one slice of the Las Campanas 
redshift survey, together with a fractal pattern generated by means of a 
Rayleigh-Levy flight ( Mandelbrot 1982 ). Both have the same mass-radius 
dimension, defined as the exponent of the power- law that fits the variation 
of mass within concentric spheres centered at the observer position. 

M{R)^FR"". (1.4) 

The best fitted value for bo th p oint distributions is Dm 2± 1.6 as shown in 
the left bottom panel of Fig. |l.l[ The different appearance of bo th point dis- 
tr ibutions is a c onseq uence of the different degree of lacunarity. Blumcnfcld 
MandelbrotI ( |l997|) have proposed to quantify this effect by measuring 



the variability of the prefactor F in Eq. |l.4] , 

E{{F-Ff} 



$ = 



The result of applying this lacunarity measure is shown in the right bottom 
panel of Fig. The visual differences between the point distributions are 
now well reflected in this curve. 



1.4 Power spectra 

The current statistical model for the main cosmological fields (density, ve- 
locity, gravitational potential) is the Gaussian random field. This field is 
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Figure 1.1. Comparison of a Las Campanas survey slice (upper left panel) with 
the Rayleigh-Levy flight model (upper right panel). The fractal dimensions of 
both distributions coincide, as shown by the In M~ln R curves in the lower left 
panel, but the lacunarity curves (in the lower right panel) differ considerably. 
The s olid lines describe the g alaxy distribution, dotted lines - the model results. 



From 



Martinez fc Saai (2001). 



determined either by its correlation function or by its spectral density, and 
one of the main goals of spatial statistics in cosmology is to estimate those 
two functions. 

In recent years the power spectrum has attracted more attention than 
the correlation function. There are at least two reasons for that — the 
power spectrum is more intuitive physically, separating processes on differ- 
ent scales, and the model predictions are made in terms of power spectra. 
Statistically, the advantage is that the power spectrum amplitudes for 
different wavenumbers are statistically orthogonal: 

E {5(k)r(k')} = {27rfSD{k - k')P(k). 

Here (5(k) is the Fourier amplitude of the overdensity field 6 = {p — p)/ p at a, 
wavenumber k, p is the matter density, a star denotes complex conjugation. 
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E{} denotes expectation values over realizations of the random field, and 
5d{^) is the three-dimensional Dirac delta function. The power spectrum 
P(k) is the Fourier transform of the correlation function ^(r) of the field. 

Estimation of power spectra from observations is a rather difficult task. 
Up to now the problem has been in the scarcity of data; in the near future 
there will be the opposite problem of managing huge data sets. The de- 
velopment of statistical techniques here has been motivated largely by the 
analysis of CMB power spectra, where better data were obtained first, and 
has been parallel to that recently. 

The first methods developed to estimate the power spectra were di- 
rect methods — a suitable statistic was chosen and determined from 



observations. A good reference is Feldman et al. (1994) 



The observed samples can be modeled by an inhomogeneous point 
process (a Gaussian Cox process) of number density n(x): 



i(x) = (5_d(x - Xt 



where 6jj{x) is the Dirac delta- function. As galaxy samples frequently have 
systematic density trends caused by selection effects, we have to write the 
estimator of the density contrast in a sample as 

6 d{x - X t) 
fi(xi) 



^(x) = ^ ""^^^^ ^"-^ - 1, 



where n(x) ^ p(x) is the selection function expressed in the number density 
of objects. 

The estimator for a Fourier amplitude (for a finite set of frequencies k^) 

is 



n(xj) 

where is a weight function that can be selected at will. The raw 

estimator for the spectrum is 

PR{k,) = i^(k,)F'^(k,), 

and its expectation value 

E {mk.)\')} = f G{k.-k')P{k')^+ f ^d'x, 
J (Stt)-^ Jy n(x) 

where G{k) — \ip(k)\'^ is the window function that also depends on the 
geometry of the sample volume. Symbolically, we can get the estimate of 
the power spectra P by inverting the integral equation 

G(g>P = Pr~ N, 

where (8> denotes convolution, Pr is the raw estimate of power, and N is 
the (constant) shot noise term. 
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In general, we have to deconvolve the noise-corrected raw power to get 
the estimate of the power spectrum. This introduces correlations in the 
estimated amplitudes, so these are not statistically orthogonal any more. 
A sample of a characteristic spatial size L creates a window function of 
width of A/c « correlating estimates of spectra at that wavenumber 
interval. 

As the cosmological spectra are usually assumed to be isotropic, the 
standard method to estimate the spectrum involves an additional step of 
averaging the estimates P{k) over a spherical shell k £ [fci, fci+i] of thick- 
ness /ci+i — ki > Afc = 1/L in wavenumber space . The minimum- variance 
requirement gives the FKP (Feldman ct al. 1994) weight function: 



V'(x) 



n(x) 



l + n(x)P(fc)- 



and the variance is 



2 



where TV is the number of coherence volumes in the shell. The number 
of independent volumes is twice as small (the density field is real). The 
coherence volume is Vc{k) « (Afc)^ « ss 

As the data sets get large, straight application of direct methods (espe- 
cially the error analysis) becomes difficult. There are different recipes that 
have been developed with the future data sets in mind. A good review of 
these methods is given in Tegmark et al. (1998). 

The deeper the galaxy sample, the smaller the coherence volume, the 
larger the spectral resolution and the larger the wavenumber interval 
where the power spectrum can be estimated. The deepest redshift surveys 
presently available are the P SCz galaxy red s hift s urvey (15411 redshifts 
up to about 400/i~^ Mpc, see [Saunders et alj ( |2000|) ), the Ab ell/ACO rich 
galaxy clus ter survey, 637 redshifts up to about 300 Mpc ( Miller fc Ba- 
tuski |2001 )), and th e ongoing 2dF galax y redshift survey (141400 redshifts 



up to 750/i ^ Mpc ( p^eacock et al. 2001)). The estimates of power spectra 



for the two latter samples have been obtained by the direct method ( Miller 



et al. 2001 , Percival et al. 2001). Fig. 1.2 shows the power spectrum for the 



2dF survey. 

The covariance matrix of the power spectrum estimates in Fig. |l.2| was 
found from simulations of a matching Gaussian Cox process in the sample 
volume. The main new feature in the spectra, obtained for the new deep 
samples, is the emergence of details (wiggles) in the power spectrum. While 
sometime ago the main problem was to estimate the mean behaviour of the 
spectrum and to find its maximum, now the data enables us to see and study 
the details of the spectrum. These details have been interpreted as traces 
of acoustic oscillations in the post-recombination power spectrum. Similar 
oscillations are predicted for the cosmic microwave background radiation 
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Figure 1.2. Power spectrum of the 2dF redshift survey, divided by a smooth model 

power spectrum. The spectrum is not deconvolved. Error bars are determined 
from Gaussian realizations; the dotted lines show the wavermmber region that is 
free of the influence of the window function and of the radial velocity distortions 
and nonlinear effects. (Courtesy of W. J. Percival and the 2dE galaxy redshift 
survey team.) 

fluctuation spectrum. The CMB wiggles match the theory rather well, but 
the galaxy wiggles do not, yet. 

Thus, the measurement of the power spectrum of the galaxy distribution 
is passing from the determination of its overall behaviour to the discovery 
and interpretation of spectral details. 



1.5 Other clustering measures 

To end this review we briefly mention other measures used to describe the 
galaxy distribution. 

1.5.1 Counts-in- cells and void probability function 

The probability that a randomly placed sphere of radius r contains exactly 
N galaxies is denoted by P{N,r). In particular, for N = 0, P(0, r) is the 

so-called void probability function, related with the empty space function 
or contact distribution function F{r), more frequently used in the field of 
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spatial statistics, by F{r) = 1 — P(0, r). The moments of the counts-in-cells 



probabiUties can be related both with the multifractal analysis (Borgani 



1993) and with the higher order 7i-point correlation functions (White 1979, 



gtoyan et al. 1995, Szapudi ct al. 1999). 



1.5.2 Nearest-neighbor distributions 

In spatial statistics, different quantities based on distances to nearest neigh- 
bors have been introduced to describe the statistical properties of point 
processes. G{r) is the distribution function of the distance r of a given 
point to its nearest neighbor. It is interesting to note that F{r) is just the 
distribution function of the distance r from an arbitrarily chosen point in 
IR"^ — not being an event of the point process — to a point of the point 
process (a galaxy in the sample in our case). The quotient 

1 - G{r) 



J{r) 



1 - F{r) 



introduced by van Lieshout fc Baddeley| ( p^ ) is a powerful tool to analyze 



point patterns and has discriminative power to compare the results of A'^- 
body models for stru cture formation with the real distribution of galaxies 



(Kerscher et al. 1999) 



1.5.3 Topology 

One very popular tool for analysis of the galaxy distribution is the genus 
of the isodensity surfaces. To define this quantity, the point process is 
smoothed to obtain a continuous density field, the intensity function, by 
means of a kernel estimator for a given bandwidth. Then we consider the 
fraction of the volume / which encompasses those regions having density 
exceeding a given threshold pt- The boundary of these regions specifies an 
isodensity surface. The genus G{S) of a surface S is basically the number of 
holes minus the number of isolated regions plus 1. The genus curve shows 
the variation of G{S) with / or pt for a given window radius of the ker- 
nel function. An analytical expression for this curve is known for Gaussian 
density fields. It seems that the empirical curve calculated from the galaxy 



catalogs ca n be reasonably well fitted to a Gaussian genus curve ( Canavezes 



3t al. 199^) for window radii varying within a large range of scales 



1.5.4 Minkowski junctionals 

A very elegant generalization of the previous analysis to a larger family 
of morphological characteristics of the point processes is provided by the 
Minkowski functionals. These scalar quantities are useful to study the shape 
and connectivity of a union of convex bodies. They are well known in spatial 
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statistics and have been introduced in cosmology by Mecke et al. (1994) 



On a clustered point process, Minkowski functionals are calculated by gen- 
eralizing the Boolean grain model into the so-called germ-grain model. This 
coverage process consists in considering the sets Ar — ufLj^Br{xi) for the 
diagnostic parameter r, where {'x.i}iLi represents the galaxy positions and 
i3r(xi) is a ball of radius r centered at point x^. Minkowski functionals 
are applied to sets Ar when r varies. In IR^ there are four functionals: the 
volume V, the surface area A, the integral mean curvature H, and the 
Euler-Poincare characteristic x, related with the genus of the boundary of 
Ar hy X = ^ — G. Application of Minkowski funct i onals to the galaxy clus- 

J 1997 ). Th e se qu antities 
([l99^). 



ter distribution can be found in Kerscher et al 



have been used also as efficient shape finders by 



Sahni et al 



Acknowledgments: This work was supported by the Spanish MCyT project 
AYA2000-2045 and by the Estonian Science Foundation under grant 2882. 
Enn Saar is grateful for the invited professor position funded by the 
Vicerrectorado de Investigacion de la Universitat de Valencia. 



References 

Blumenfeld R & Mandelbrot B 1997 Phys. Rev. E 56, 112-118. 
Borgani S 1993 Mon. Not. R. Astr. Soc. 260, 537-549. 

Canavezes A, Springel V, Oliver S J, Rowan- Robinson M, Keeble O, White 
S D M, Saunders W, Efstathiou G, Frenk C S, McMahon R G, Maddox 
S, Sutherland W & Tadros H 1998 Mon. Not. R. Astr. Soc. 297, 777-793. 

Feldman H A, Kaiser N & Peacock J A 1994 Astrophys. J. 426, 23-37. 

Jensen M H, Paladin G & Vulpiani A 1991 Phys. Rev. Lett. 67, 208-211. 

Kerscher M, Pons-Borderia M, Schmalzing J, Trasarti-Battistoni R, Buchert T, 
Martmez V J & Valdarnini R 1999 Astrophys. J. 513, 543-548. 

Kerscher M, Schmalzing J, Retzlaff J, Borgani S, Buchert T, Gottlober S, MuUer 
V, Plionis M & Wagner H 1997 Mon. Not. R. Astr. Soc. 284, 73-84. 

Kerscher M, Szapudi I & Szalay A S 2000 Astrophys. J. 535, L13-L16. 
Mandelbrot B B 1982 The fractal geometry of nature W.H. Freeman San 
Francisco. 

Martmez V J 1999 Science 284, 445-446. 

Martmez V J, Jones B J T, Dommguez-Tenreiro R & van de Weygaert R 1990 

Astrophys. J. 357, 50-61. 
Martmez V J, Lopez-Martf B & Pons-Borden'a M J 2001 Astrophys. J. 554, L5- 

L8. 

Martmez V J, Paredes S, Borgani S & Coles P 1995 Science 269, 1245-1247. 
Martmez V J & Saar E 2001 Statistics of the Galaxy Distribution Chapman and 
Hall/CRC Press Boca Raton. 



1. Statistics of Galaxy Clustering 13 



Mecke K R, Buchert T & Wagner H 1994 Astron. Astrophys. 288, 697-704. 
Miller C J & Batuski D J 2001 Astrophys. J. 551, 635-642. 



Miller C J, Nichol R C & Batuski D J 2001 . |astro-ph/0 1 03018] , submitted to 
Astrophys. J. 

Peacock J A, Cole S, Norberg P, Baugh C M, Bland-Hawthorn J, Bridges T, 
Cannon R D, CoUess M, Collins C, Couch W, Dalton G, Deeley K, Propris 
R D, Driver S P, Efstathiou G, Ellis R S, Frenk C S, Glazebrook K, Jackson 
C, Lahav O, Lewis 1, Lumsden S, Maddox S, Percival W J, Peterson B A, 
Price I, Sutherland W & Taylor K 2001 Nature 410, 169-173. 

Percival W J, Baugh C M, Bland-Hawthorn J, Bridges T, Cannon R, Cole S, 
CoUess M, Collins C, Couch W, Dalton G, Propris R D, Driver S P, Efs- 
tathiou G, Ellis R S, Frenk C S, Glazebrook K, Jackson C, Lahav O, Lewis 
1, Lumsden S, Maddox S, Mood y S, Norberg P, Pe acock J A, Peterson B A, 
Sutherland W & Taylor K 2001. |a,stro-ph/0105252[ submitted to Mon. Not. 
R. Astr. Soc. 

Pietronero L 1987 Physica A 144, 257. 

Pons-Borden'a M J, Martinez V J, Stoyan D, Stoyan H & Saar E 1999 Astrophys. 
J. 523, 480-491. 

Sahni V, Sathyaprakash B S & Shandarin S F 1998 Astrophys. J. 495, L5-L8. 

Saunders W & Ballinger B E 2000 m R. C Kraan-Korteweg, P. A Ken- 
ning & H Andernach, eds, 'The Hidden Universe, A SP Conference Ser ies' 
Astronomical Society of the Pacific, San Francisco. ^stro-ph/0005606 , in 
press. 

Saunders W, Sutherland W J, Maddox S J, Keeble O, Oliver S J, Rowan- Robinson 
M, McMahon R G, Efstathiou G P, Tadros H, White S D M, Frenk C S, 
Carraminana A & Hawkins MRS 2000 Mon. Not. R. Astr. Soc. 317, 55-64. 

Schlegel D J, Finkbeiner D P & Davis M 1998 Astrophys. J. 500, 525-553. 

Schmoldt I M, Saar V, Saha P, Branchini E, Efstathiou G P, Frenk C S, Keeble 
O, Maddox S, McMahon R, Oliver S, Rowan-Robinson M, Saunders W, 
Sutherland W J, Tadros H & White S D M 1999 Astron. J. 118, 1146-1160. 

Stoyan D, Kendall W & Mecke J 1995 Stochastic Geometry and its Applications 
John Wiley & Sons Chichester. 

Stoyan D & Stoyan H 2000 Scand. J. Satist. 27. 641-656. 

Szapudi 1, Colombi S & Bernardeau F 1999 Mon. Not. R. Astr. Soc. 310, 428- 
444. 

Tegmark M, Hamihon A J S, Strauss M A, Vogeley M S & Szalay A S 1998 

Astrophys. J. 499, 555-576. 
van Lieshout M N M & Baddeley A 1996 Stat. Neerlandica 50, 344. 
White S D M 1979 Mon. Not. R. Astr. Soc. 186, 145-154. 



